Fine Grained Classification of Named Entities
نویسندگان
چکیده
While Named Entity extraction is useful in many natural language applications, the coarse categories that most NE extractors work with prove insufficient for complex applications such as Question Answering and Ontology generation. We examine one coarse category of named entities, persons, and describe a method for automatically classifying person instances into eight finergrained subcategories. We present a supervised learning method that considers the local context surrounding the entity as well as more global semantic information derived from topic signatures and WordNet. We reinforce this method with an algorithm that takes advantage of the presence of entities in multiple contexts.
منابع مشابه
Fine Grained Classification of Named Entities In Wikipedia
Fine Grained Classification of Named Entities In Wikipedia Maksim Tkachenko, Alexander Ulanov, Andrey Simanovsky
متن کاملFine-Grained Classification of Named Entities by Fusing Multi-Features
Due to the increase in the number of classes and the decrease in the semantic differences between classes, fine-grained classification of Named Entities is a more difficult task than classic classification of NEs. Using only simple local context features for this fine-grained task cannot yield a good classification performance. This paper proposes a method exploiting Multi-features for fine-gra...
متن کاملClassifying Articles in Chinese Wikipedia with Fine-Grained Named Entity Types
Named entity classification of Wikipedia articles is a fundamental research area that can be used to automatically build large-scale corpora of named entity recognition or to support other entity processing, such as entity linking, as auxiliary tasks. This paper describes a method of classifying named entities in Chinese Wikipedia with fine-grained types. We considered multi-faceted information...
متن کاملFine-Grained Classification of Named Entities Exploiting Latent Semantic Kernels
We present a kernel-based approach for finegrained classification of named entities. The only training data for our algorithm is a few manually annotated entities for each class. We defined kernel functions that implicitly map entities, represented by aggregating all contexts in which they occur, into a latent semantic space derived from Wikipedia. Our method achieves a significant improvement ...
متن کاملLow-Complexity Heuristics for Deriving Fine-Grained Classes of Named Entities from Web Textual Data
We introduce a low-complexity method for acquiring fine-grained classes of named entities from the Web. The method exploits the large amounts of textual data available on the Web, while avoiding the use of any expensive text processing techniques or tools. The quality of the extracted classes is encouraging with respect to both the precision of the sets of named entities acquired within various...
متن کاملFine-Grained Named Entity Recognition Using Conditional Random Fields for Question Answering
In many QA systems, fine-grained named entities are extracted by coarse-grained named entity recognizer and fine-grained named entity dictionary. In this paper, we describe a fine-grained Named Entity Recognition using Conditional Random Fields (CRFs) for question answering. We used CRFs to detect boundary of named entities and Maximum Entropy (ME) to classify named entity classes. Using the pr...
متن کامل